Taiwan Hakka Languages and TWHK_ToBI Annotation Conventions
نویسندگان
چکیده
This paper proposes a preliminary prosodic annotation system for Taiwan Hakka, “Taiwan Hakka Tones and Break Indices” is called TWHK_ToBI. TWHK_ToBI includes five tiers: ortho, words, tones, breaks, and miscellaneous. The ortho tier contains Romanization of each syllable and dictionary-defined tones; the words tier includes alphabetized SAMPA spellings of each word; the tones tier includes the sandhi tones for each syllable; the breaks tier indicates degree of juncture including words, fused words, intermediate phrase and intonational phrase boundaries; and the miscellaneous tier labels events such as code switching, laugh, and cough.
منابع مشابه
Toward Constructing A Multilingual Speech Corpus for Taiwanese (Min-nan), Hakka, and Mandarin Chinese
The Formosa speech database (ForSDat) is a multilingual speech corpus collected at Chang Gung University and sponsored by the National Science Council of Taiwan. It is expected that a multilingual speech corpus will be collected, covering the three most frequently used languages in Taiwan: Taiwanese (Min-nan), Hakka, and Mandarin. This 3-year project has the goal of collecting a phonetically ab...
متن کاملMultilingual Speech Corpora for TTS System Development
In this paper, four speech corpora collected in the Speech Lab of NCTU in recent years are discussed. They include a Mandarin treebank speech corpus, a Min-Nan speech corpus, a Hakka speech corpus, and a Chinese-English mixed speech corpus. Currently, they are used separately to develop a corpus-based Mandarin TTS system, a Min-Nan TTS system, a Hakka TTS system, and a Chinese-English bilingual...
متن کاملConstruct a multi-lingual speech corpus in taiwan with extracting phonetically balanced articles
In this paper, we describe an initial stage to construct a multilingual speech corpus in Taiwan with selecting phonetically balanced scripts. It is expected to collect a multilingual speech corpus covering three most frequently used languages in Taiwan, including Taiwanese (Min-nan), Hakka, and Mandarin Chinese. To achieve the objective, constructing a multilingual phonetic alphabet, namely For...
متن کاملThe Effects of Lexical Tones and Nasal Coda /-n/ to Sadness in Taiwan Hakka
This paper concerns the relation between the emotion of sadness & lexical tone types, and the relation between the emotion of sadness & nasal coda /-n/ for non-Hakka speakers with Hakka stimuli. We try to probe what factors cause non-Hakka speakers to receive the expression of sadness in Hakka language successfully. The results showed that in both level tones and checked tone, the average f0 va...
متن کاملScrub Typhus and Comparisons of Four Main Ethnic Communities in Taiwan in 2004 versus 2008 Using Geographically Weighted Regression
PURPOSE On the main island of Taiwan, a higher risk of scrub typhus infection has been reported in endemic clusters in Southeastern Taiwan and in mountainous township areas. However, research on health care problems associated with scrub typhus in Taiwanese ethnic peoples is limited. This study employs spatial analysis of areal data to determine spatial features related to scrub typhus and the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011